A high-dimensional Wilks phenomenon

نویسندگان

  • Stéphane Boucheron
  • Pascal Massart
چکیده

A theorem by Wilks asserts that in smooth parametric density estimation the difference between the maximum likelihood and the likelihood of the sampling distribution converges toward a chi-square distribution where the number of degrees of freedom coincides with the model dimension. This observation is at the core of some goodness-of-fit testing procedures and of some classical model selection methods. This paper describes a nonasymptotic version of the Wilks phenomenon in bounded contrast optimization procedures. Using concentration inequalities for general functions of independent random variables, it proves that in bounded contrast minimization (as for example in Statistical Learning Theory), the difference between the empirical risk of the minimizer of the true risk in the model and the minimum of the empirical risk (the excess empirical risk) satisfies a Bernstein-like inequality where the variance term reflects the dimension of the model and the scale term reflects the noise conditions. From a mathematical statistics viewpoint, the significance of this result comes from the recent observation that when using model selection via penalization, the excess empirical risk represents a minimum penalty if non-asymptotic guarantees concerning prediction error are to be provided. From the perspective of empirical process theory, this paper describes a concentration inequality for the supremum of a bounded noncentered (actually non-positive) empirical process. Combining the now classical analysis of M-estimation (building on Talagrand’s inequality for suprema of empirical processes) and versatile moment inequalities for functions of independent random variables, this paper develops a genuine Bernstein-like inequality that seems beyond the reach of traditional tools. Supported by ANR grant TAMIS Supported by Network of Excellence PASCAL II Stéphane Boucheron Laboratoire Probabilités et Modèles Aléatoires Université Paris-Diderot 175 rue du Chevaleret, 75013-F Paris E-mail: [email protected] Pascal Massart Département de Mathématiques Université Paris-Sud 91405-F Orsay E-mail: [email protected] 2 Stéphane Boucheron, Pascal Massart

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comments on: Nonparametric inference with generalized likelihood ratio tests

This is a very interesting paper reviewing the technique for testing semiparametric hypotheses using GLR tests. I’d like to supplement Fan and Jiang’s review with some cautions and a somewhat different point of view. 1 The Wilks phenomenon Rigorous results for smooth parametric models, see, for example, Bickel and Doksum (2005, Chap. 6) do say that, if θ̂ , η̂ or, equivalently, (θ̂η̂, η̂) are MLE’s,...

متن کامل

Nonparametric Inferences for Additive Models

Additive models with backfitting algorithms are popular multivariate nonparametric fitting techniques. However, the inferences of the models have not been very well developed, due partially to the complexity of the backfitting estimators. There are few tools available to answer some important and frequently asked questions, such as whether a specific additive component is significant or admits ...

متن کامل

Scaled laboratory experiments explain the kink behaviour of the Crab Nebula jet

The remarkable discovery by the Chandra X-ray observatory that the Crab nebula's jet periodically changes direction provides a challenge to our understanding of astrophysical jet dynamics. It has been suggested that this phenomenon may be the consequence of magnetic fields and magnetohydrodynamic instabilities, but experimental demonstration in a controlled laboratory environment has remained e...

متن کامل

Generalized Likelihood Ratio Statistics and Wilks Phenomenon

The likelihood ratio theory contributes tremendous success to parametric inferences. Yet, there is no general applicable approach for nonparametric inferences based on function estimation. Maximum likelihood ratio test statistics in general may not exist in nonparametric function estimation setting. Even if they exist, they are hard to find and can not be optimal as shown in this paper. We intr...

متن کامل

Wilks' phenomenon and penalized likelihood-ratio test for nonparametric curve registration

The problem of curve registration appears in many different areas of applications ranging from neuroscience to road traffic modeling. In the present work, we propose a nonparametric testing framework in which we develop a generalized likelihood ratio test to perform curve registration. We first prove that, under the null hypothesis, the resulting test statistic is asymptotically distributed as ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010